Z-Glyph: Visualizing outliers in multivariate data
نویسندگان
چکیده
Outlier analysis techniques are extensively used in many domains such as intrusion detection. Today, even with the most advanced statistical learning techniques, human judgment still plays an important role in outlier analysis tasks due to the difficulty of defining and collecting outlier examples. This work seeks to tackle this problem by introducing a new visualization design, ‘‘Z-Glyph,’’ a family of glyphs designed to facilitate human judgment in outlier analysis of multivariate data. By employing a location-scale transformation, a ZGlyph represents the ‘‘normal’’ data using regular shapes (e.g. straight line and circle), such that the abnormal data can be revealed when deviating from the regular shapes. Extensive controlled experiment and case studies based on real-world datasets indicate the superior performance of the Z-Glyph family, compared with the baselines, suggesting that the proposed design is able to leverage human perceptional features with statistical characterization. This study contributes to a more fundamental understanding about designing visual representations for revealing outliers in multivariate data, which can be applied as a building block in many domain-specific anomaly detection applications.
منابع مشابه
Leaf Glyph - Visualizing Multi-dimensional Data with Environmental Cues
In exploratory data analysis, important analysis tasks include the assessment of similarity of data points, labeling of outliers, identifying and relating groups in data, and more generally, the detection of patterns. Specifically, for large data sets, such tasks may be effectively addressed by glyph-based visualizations. Appropriately defined glyph designs and layouts may represent collections...
متن کاملSurvey of glyph-based visualization techniques for spatial multivariate medical data
In this survey article, we review glyph-based visualization techniques which have been exploited when visualizing spatial multivariate medical data. To classify these techniques, we derive a taxonomy of glyph properties that is based on classification concepts established in information visualization. By considering both the glyph visualization as well as the interaction techniques that are emp...
متن کاملLeaf Glyphs: Story Telling and Data Analysis Using Environmental Data Glyph Metaphors
In exploratory data analysis, important analysis tasks include the assessment of similarity of data points, labeling of outliers, identifying and relating groups in data, and more generally, the detection of patterns. Specifically, for large data sets, such tasks may be effectively addressed by glyph-based visualizations. Appropriately defined glyph designs and layouts may represent collections...
متن کاملVisual Exploration of Time-Series Data with Shape Space Projections
Time-series data is a common target for visual analytics, as they appear in a wide range of application domains. Typical tasks in analyzing time-series data include identifying cyclic behavior, outliers, trends, and periods of time that share distinctive shape characteristics. Many methods for visualizing time series data exist, generally mapping the data values to positions or colors. While ea...
متن کاملVisualization of Uncertain Multivariate 3D Scalar Fields
DAVID FENG: Visualization of Uncertain Multivariate 3D Scalar Fields (Under the direction of Russell M. Taylor II) This dissertation presents a visualization system that enables the exploration of relationships in multivariate scalar volume data with statistically uncertain values. The nDimensional Volume Explorer (nDive) creates multiple visualizations that emphasize different features of the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017